Appendix B 



[al ign] ***** 

options = -align -type=protein -matr ix=blosum -gapdist=8 -maxdiv=40 ~outorder=al igned -gapopen=10 -gapext==0. 2 -pwmatr ix=b!osum -pwgapopen=10 -pwgapext=0. 1 
CLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence type explicitly set to Protein 
Sequence format is Pearson 
Sequence 1: E1-A0209P-2 498 aa 
Sequence 2: E1-A0209P-8 459 aa 
Start of Pairwise alignments 
A I igning. . . 

Sequences (1.2) Aligned. Score: 34 
Start of Multiple Alignment 
There are 1 groups 
Al igning. . . 

Group 1: Delayed 
Sequenced Score: 2988 
A I ignment Score 779 



query, a In 

CLUSTAL W (1.83) multiple sequence alignment 



E1-A0209P-2 
E1-A0209P-8 



«ATVHQKNMSTLKQRKEDFVTGLN6GS I TE I NAVTS I ALVTY I SWNLLKNSNLMPPG 1 SS 
MSYK LEKEAFVSN LTGSSS I ETCGLLL I G I AGNVLWVNMTARN I LPK6N-L 



E1-A0209P-2 
E1-A0209P-8 



E1-A0209P-2 
E1-A0209P-8 



E1-A0209P-2 
E1-A0209P-8 



E1-A0209P-2 
E1-A0209P-8 



E1-A0209P-2 
E1-A0209P-8 



VGY I I DFALNWVALLLS I T I YASEPYLLNTL I LLPGLLAF I YGKFTSSSKPSNP I YNKKK 
GFLVEFFI FCL I PLFV I YVSSKVGVFTLC I ASFLPSFVLHV I SP I NWDVLRRKPGCCLTK 



:. *: : 



:**.:: 



:* 



M I TQRFGLEKKPY I TAYRGGML ILTAIAI LAVDFP I FPRRFAKVETWGTSLMDLGVGSFV 
KNENTFDR-R I AGVTFYRSQMMLVTVTC I LAVDFTLFPRRYAKVETWGTSLMDLGVGSFM 

: *: :* **. *:::*. .****** 

FSNG I VSSRALLKNLSLKSKPSFLKNAFNALKSGGTLLFLGLLRLFFVKNLEYQEHVTEY 
FSSGTVAGR KND I KKPNAFKNVLWNSF I LL I LGFARMFLTKS I NYQEHVSEY 

**.**:.* *. . . *. * . . **:**: *:*:.*.::*****:** 

GVHWNFF I TLSLLPLVLTF I DPVTRMVPRCS I A I F I SC I YEWLLLKDDRTLNFL I LADRN 
GMHWN FFFT L GFMALG VFFFRRS LKKVS YFN LATF I TLLHHGLLVLTP-FGKWALSAPRT 
* : ***** : ** ::.*:*: : *, .:***:::.**: :::**. 

CFFSANREG I FSFLGYCS I FLWGQNTGFYLLGNKPTLNNLYKPSTQDVVAASKKSSTWDY 
N I LAQNREG I ASLPGY I A I YFYGMYTGS WLADRPLMY TRAESWKR 



E1-A0209P-2 WTSVTPLSGLGI WST I FLV I SQLVFQYHPYSVSRRFANLPYTLWVI TYNLLFLTGYCLTD 
E1-A0209P-8 FQRLLFP— LCI LLVLYLVS NFLSVGVSRRLANTPYVANVAFI NMFFLT I Yl L I D 

*** : : ** : : . . **** :****. * * : : *** * * * 



E1-A0209P-2 
E1-A0209P-8 



E1-A0209P-2 
E1-A0209P-8 



Kl FGNSSEYY— KVAEGLES INSNGLFLFLLANVSTGLVNMSMVTI DSSPLKSFLVLLAY 
AYLFPSSVPYGSRVPKLLEDANNNGLLVFL I ANVLTGVVNLSFDTLHSSNAKGLTI MTMY 

** * :* . : ** ****::**:*****;**:*; *:.** * ; * 

CSFI AV i SVFLYRKR I F I KL 
LFI ICYMAHWLAQHGIRFRL 

:*. : : :* : : * : :* 



query, dnd 

(E1-A0209P-2:0. 32789. E1-A0209P-8:0. 32789) ; 



Appendix B 



Eal ign] 

options = -align -type=protein -inatr i x=b losum -gapdist=8 -maxdiv=40 -outorder=al igned -gapopen=10 -gapext^O. 2 -pwmatr ix=blosum -pwgapopen=10 -pwgapext=0. 1 
GLUSTAL W (1.83) Multiple Sequence Alignments 



Sequence type explicitly set to Protein 

Sequence format is Pearson 

Sequence 1: E1-A0209P-2 498 aa 

Sequence 2: E1-A0209P-14 598 aa 

Start of Pairwise alignments 

A I i gn i ng. . . 

Sequences (1:2) Aligned. Score: 33 
Start of Multiple Alignment 
There are 1 groups 
Al igning. . . 

Group 1: Delayed 
Sequenced Score: 3104 
Alignment Score 789 



query, aln 

CLUSTAL W (1.83) multiple sequence alignment 



E1-A0209P-2 MAT VHQKNMST LKQRKEDFVTG LNG6S I TE I NAVTS I ALVTY I SWNLLK 

E1-A0209P-1 4 MGDYKSAKEAFVSDNPGAS I WS I NAVSLVALATYALW I ALSPY I RHGLLNNY 

* * ** **:. * ** .****: :**.** * *. 

E1-A0209P-2 NSNLMPPGISSVQYI I DFALNWVALLLS I T I YASEP 

E1-A0209P-14 L ICVLPLLFGVTIFSTSPLVFTSFLSI I SLAF I TKSQKCFKSVSSPEKPKGQWLDESDSD 

* : :*. **.:*: : : : : . * 



E1-A0209P-2 YLLNTU LIPCLLAFI YGKFTSSSKPSNPI Y 

E1-A0209P-1 4 EEPAEPASAAGSAAVSPVKLLPSQVAFASGSLLSPDPTTSPMSPSSSSASGHEDPLG I MG 

: ***, :** . :. *: 



-PYITAYRGGM 



E1-A0209P-2 -NKKKM I TGRFGLEKK 

E1-A0209P-1 4 VNRRRS LL EGVS LDVPSH I DSKVR I S PVPYLR L.KKSRATKAQW VK EKGRL PFLTVYRAHM 

*:::::..*: *::***.* 



E1-A0209P-2 L I LTA I A I LAVDFP I FPRRFAKVETWGTSLMDLGVGSFVFSNG I VSSRAL 

E1-A0209P-14 MLMTVI CI LAVDFEVFPRWGGKGEOFGTSLMDVGVGSFVFSLGLVSTKSLSPPPPTPTPS 
:::******** :*** .** :******:*********:**:::* 



E1-A0209P-2 — LKNLSLKSKPSFLKNAFNALKSGGTLLFLGLLRLFFVKNLEYGEHVTEY6VHWNFFI 

E1-A0209P-14 SPALNSHI I PLTPSPFTS ILISLRKS I PI LVLGF I RL I MVKGSDYPEH VT E YGVHWNFFF 

*:. . ** : ::*:.. .:*.**::**::**. :**************: 

E1-A0209P-2 TLSLLPLVLTF I DPVTRMVPRCS I Al F I SC I YEWLLLKDDRTLNFL I LADRNCFFSANRE 

E1-A0209P-1 4 TLALVPVLAVG I RPLTQWLRWSVLGV I I SLLHGLWLTYYLGS I VFSFGRSG— I FLANKE 

**:*:*::.**:*:: .:.::**::: * :::*: :***:* 

E1-A0209P-2 G I FSFLGYCSI FLWGGNTGFYLL6NKPTLNNLYKPSTGDVVAASKKSSTWDYWTSVTPLS 

E1-A0209P-1 4 GFSSLPGYLS I FL I GLS I GDHVLR LSLPPRRERVVSETNEEHEGSHFERKKLOL I 

* ■ * ; ** **** *_*;;* , * :* * 

E1-A0209P-2 GLC I WST I FLV I SQLVFQYHPYSVSRRFANLPYTLWV I TYNLLFLTGYCLTDK I FGN 

E1-A0209P-1 4 MEL I GYSLGWWALLGGWIWAGGEVSRRLANAPYVFWVAAYNTTFLLGYLLLTH 1 1 PSPTS 
* ****:** **_ ;**;** ***** :*: 



E1-A0209P-2 — SSEYYKVAECLESINSNGLFLFLLANVSTGLVNMSMVTIDSSPLKSFLVLLAYGSFIA 
El -A0209P-1 4 SQTSPS I LVPPLLDAMNKNGLA I FLAANLLTGLVNVSMKTMYAPAWLSMGVLMLYTLT I S 

:* *. *:::*.*** :****: *****:***; *;**:* *: 



E1-A0209P-2 VISVFLYRKRIFIKL 
E1-A0209P-14 CVGWILKGRRIKI — 
:* :** * 



query, dnd 

(E1-A0209P-2:0. 33032. E1-A0209P-14:0. 33032) ; 



Appendix B 



[align] ************** 

options = -align -type=protein -tnatr i x=b I osum -gapdist=8 -maxdiv=40 -outorder=a I igned -gapopen=10 -gapext=0. 2 -pwmatr ix=blosum -pwgapopen=10 -pwgapext=0. 1 
CLUSTAL W (1.83) Multiple Sequence Alignments 

Sequence type explicitly set to Protein 

Sequence format is Pearson 

Sequence 1: E1-A0209P-8 459 aa 

Sequence 2: E1-A0209P-14 598 aa 

Start of Pairwise alignments 

Al igning. . . 

Sequences (1:2) Aligned. Score: 33 
Start of Multiple Alignment 
There are 1 groups 
A I i gn i ng. . . 

Group 1: Delayed 
Sequenced Score: 2939 
A I ignment Score 741 

query, aln 

GLUSTAL W (1.83) multiple sequence alignment 



E1-A0209P-8 -MSYKLEKEAFVSNLTGSSS I ETGGLLL I G I ACNVLWVNMTARN I LPKGN LGFL 

E1 -A0209P-1 4 MGDYKSAK EAFVSDNPGAS I WS I NAVSLVALATYALW I ALSPY I RHGLLNNYL I CVLPLL 
. ** ******: *:* .:*:.:* .**: * * :* 



E1-A0209P-8 
E1-A0209P-14 



VEFF I FCL I PLFVI YVSSKVGVFTLG I AS 

FGVT I FSTSPLVFTSFLS 1 1 SLAF I TKSQKCFKSVSSPEKPKGGWLDESDSDEEPAEPAS 



E1-A0209P-8 
E1-A0209P-14 



AAGSAAVSPVKLLPSQVAFASGSLLSPDPTTSPMSPSSSSASGHEDPLG I MGVNRRRSLL 



E1-A0209P-8 
E1-A0209P-14 



E1-A0209P-8 
E1-A0209P-14 



PSFVLHVISPINWDVLR RKPGCCLTKKNENTFDRRIAGVTFYRSQMMLVTVTC! 

EGVSLDVPSH I DSKVR I SPVPYLRLKKSRATKAQWVKEKGRLPFLTVYRAHMMLMTV I CI 

..*.***:.* * **: :*.**::***:** ** 

LAVDFTLFPRRYAKVETWGTSLMDLGVGSFMFSSGTVAGRKND I KKPNAFKN 

LAVDFE VFPRWQGKCEOFGTS LMD VG VGS F VFSLGL VSTKS L S PP PPT PT PSS PALN S H I 
***** :*** .* * :******:*****:** **::.. *. . 



E1-A0209P-8 VLWNSFILL I LGFARMFLTKS I NYQEHVSEYGMHWNFFFTLGFMALG 

E1-A0209P-1 4 I PLTPSPFTS I L I SLRKS I PI LVLGFIRL I MVKGSDYPEHVTEYGVHWNFFFTLALVPVL 

* :*: :*:*** *:::.*. :* ***:*= 



E1-A0209P-8 VFFFRRSLKKVSYFNLATF I TLLHHGLLVLTPFGKWALSAPRTN I LAQNREG I ASLPGY I 

E1-A0209P-14 AVG I RPLTQWLRWSVLGVI ISLLHQLWLTYY-LQSIVFSF6RSGI FLANK EG FSSLPGYL 

:* *.:*:***: *. :*..:* *:.*: *:**:: 



E1-AO209P-8 
E1-A0209P-14 



A I YFYGMYTGSVVLADRP— 



-LMYTRAESWKRFQRLL FPLC I LLVLYL VSNFLSV 



S I FL I GLS I GOHVLRLSLPPRRERVVSETNEEHEQSHFERKKLOL I MEL I GYSLGWWALL 

:*;:*: * ** :*:* :*:*:* 



E1-A0209P-8 G VSRRLANTPYVANVAF I NMFFLT I Y I L I DAYLFPSSVPYGSR — VPKLLE 

E1 -A0209P-1 4 GGW I WAGGEVSRRLANAPYVFWVAAYNTTFLLGYLLLTH 1 1 PSPTSSQTSPS I LVPPLLD 
#:*** ***** *.*: * ** **: 



E1 -A0209P-8 DANNNGLLVFL I ANVLTGVVNLSFDTLHSSNAKGLT I MTMYLF 1 1 GYMAHWUQHG I RFR 

E1-A0209P-14 AMNKNGLA! FLAANLLTGLVNVSMKTMYAPAWLSMGVLMLYTLTISCVGWI LKGRRIKl- 

*:*** :** **:***:**:*:.*:;:. :* ; *. * : *: : 



E1-A0209P-8 
E1-A0209P-14 



query, dnd 

(El -A0209P-8 :0. 33007. E1-A0209P-14: 0. 33007) ; 



